UNCLASSIFIED 


Defense  Technical  Information  Center 
Compilation  Part  Notice 

ADPO 14040 

TITLE:  Space-Time  Detection  Theory 

DISTRIBUTION:  Approved  for  public  release,  distribution  unlimited 
Availability:  Hard  copy  only. 


This  paper  is  part  of  the  following  report: 

TITLE:  Military  Application  of  Space-Time  Adaptive  Processing  [Les 
applications  militaires  du  traitement  adaptatif  espace-temps] 

To  order  the  complete  compilation  report,  use:  ADA415645 

The  component  part  is  provided  here  to  allow  users  access  to  individually  authored  sections 
of  proceedings,  annals,  symposia,  etc.  However,  the  component  should  be  considered  within 
the  context  of  the  overall  compilation  report  and  not  as  a stand-alone  technical  report. 

The  following  component  part  numbers  comprise  the  compilation  report: 

ADPO  14040  thru  ADPO  14047 


UNCLASSIFIED 


1-1 


Space-Time  Detection  Theory 

William  L.  Melvin 

Georgia  Tech  Research  Institute 
7220  Richardson  Road,  Smyrna,  GA  30080,  USA 

bill.melvin@gtri.gatech.edu 

Summary 

Radar  systems  must  detect  targets  in  the  presence  of  clutter  and  jamming  (colored  noise)  signals.  By 
exploiting  signal  diversity,  space-time  adaptive  processing  (STAP)  improves  radar  detection  performance  in 
colored  noise-limited  environments.  This  set  of  lecture  notes  describes  fundamental  aspects  of  space-time 
detection  theory  in  Gaussian  noise.  We  first  describe  basic  detection  theory;  key  aspects  of  the  discussion 
include  the  formation  of  a sufficient  statistic,  threshold  setting,  matched  filtering,  and  the  notion  of  a whitening 
filter.  We  then  examine  the  role  space-time  signal  diversity  plays  in  enhancing  target  detection  performance  in 
colored  noise  environments.  Next,  we  define  commonly  used  performance  metrics,  including  signal-to- 
interference-plus-noise  ratio  (SINR)  loss  factors.  The  latter  part  of  the  paper  describes  practical  aspects  of 
space-time  detection:  we  consider  various  detector  structures,  estimation  of  unknown  parameters,  and  the 
impact  of  clutter  heterogeneity  on  detection  performance. 

1.  Introduction 


Target  detection  serves  as  the  primary  goal  of  radar.  Moving  target  indication  (MTI)  is  a common  radar 
mission  involving  the  detection  of  airborne  or  surface  moving  targets.  The  signal-to-noise  ratio  (SNR)  - a 
characterization  of  the  noise-limited  performance  of  the  radar  against  a target  with  radar  cross  section  aT  at 
range  r - is  approximated  as 
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where  P,  is  peak  transmit  power,  G, {</>,&)  is  antenna  gain,  Ae  is  the  effective  receive  aperture  area,  Njn  is  the 
input  noise  power,  F„  is  the  receiver  noise  figure  and  represents  radio  frequency  (RF)  system  losses  [1], 

Assuming  the  noise  is  uncorrelated  (white)  and  Gaussian,  the  probability  of  detection  is  a one-to-one  function 
of  both  SNR  and  probability  of  false  alarm.  It  is  also  important  to  point  out  that  by  maximizing  SNR , the 
processor  maximizes  probability  of  detection.  In  light  of  (1),  by  increasing  power-aperture  P,Ae,  the  radar 
designer  ensures  detection  of  targets  with  diminishing  radar  cross  section  at  farther  range.  System  constraints 
and  cost  limit  the  deployable  power-aperture  product. 

In  addition  to  noise-limited  constraints,  the  aerospace  radar  system  design  must  accommodate  the  impact  of 
ground  clutter  and  jamming  on  moving  target  detection.  Clutter  and  jamming  represent  colored  noise;  unlike 
white  noise,  clutter  and  jamming  exhibit  a degree  of  correlation.  Either  individually,  or  collectively,  we  refer 
to  clutter  and  jamming  as  interference.  Interference  increases  the  amount  of  ambiguity  in  the  target  decision 
process.  In  other  words,  as  the  interference  increases,  it  becomes  more  difficult  to  decide  whether  a target  is 
present  in  a given  observation.  Analogous  to  the  white  noise  detection  scenario,  the  probability  of  detection 
depends  on  both  signal-to-interference-plus-noise  ratio  (SINR)  and  the  specified  false  alarm  rate.  Since 
SINR  < SNR , interference  always  degrades  detection  performance  in  comparison  with  the  noise-limited  case. 

Signal  diversity,  in  the  form  of  spatial  and  temporal  degrees  of  freedom  (DoF),  greatly  enhances  radar 
detection  in  the  presence  of  colored  noise.  Specifically,  the  appropriate  application  of  space-time  DoFs 
maximizes  signal-to-interference-plus  noise  ratio  (SINR)  when  the  target  competes  with  ground  clutter  and 
barrage  jamming.  Clutter  exhibits  correlation  in  both  spatial  and  temporal  dimensions,  while  jamming  is 
predominantly  correlated  in  angle  for  modest  bandwidth.  Space-time  adaptive  processing  (STAP)  involves 
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adaptively  (or  dynamically)  adjusting  the  two-dimensional  filter  response  in  an  attempt  at  maximizing  the 
filter’s  output  SINR,  and  consequently,  improving  radar  detection  performance. 

The  objective  of  this  set  of  lecture  notes  is  to  develop  the  basic  theory  of  space-time  detection.  Six  sections 
comprise  this  paper.  In  the  next  section,  we  begin  by  considering  basic  elements  of  hypothesis  testing  and 
detection  theory.  The  important  notions  of  matched  filtering  and  whitening  arise  from  this  discussion. 
Subsequently,  in  Section  3 we  describe  the  importance  of  signal  diversity  in  advanced  radar  sensor  design. 
Radar  offers  signal  diversity  in  the  following  dimensions:  spatial,  slow-time  (or  Doppler),  fast-time  (or  range), 
polarization  and  multi-scan.  An  appropriate  selection  of  these  available  degrees  of  freedom  (DoF)  is  necessary 
for  effective  performance  enhancement.  Herein,  our  interest  lies  in  the  space  and  slow-time  dimensions.  (In  a 
companion  paper  [2],  we  describe  the  simultaneous  application  of  space  and  fast-time  DoFs  to  mitigate  terrain 
scattered  jamming.)  After  discussing  signal  diversity,  we  develop  several  typical  STAP  performance  metrics 
in  Section  4;  the  metrics  enable  comparison  of  competing  STAP  techniques  and  non-adaptive  processing 
schemes.  Section  5 investigates  several  practical  STAP  implementation  structures.  STAP  is  a data  domain 
implementation  of  an  optimum  filter;  practical  implementation  issues  include  estimating  unknown  quantities, 
alleviating  computational  burden  and  minimizing  training  data  requirements.  Another  issue  of  practical 
concern  is  clutter  heterogeneity.  Clutter  heterogeneity  degrades  STAP  performance  by  exacerbating  the 
estimation  the  interference  covariance  matrix,  an  essential  STAP  component.  Thus,  we  devote  the  final 
section  of  the  paper  - Section  6 - to  issues  centering  on  covariance  estimation  errors  resulting  from  clutter 
heterogeneity. 

The  following  is  a list  of  the  more  common  variables  used  in  this  paper: 

Pu  = probability  of  detection; 

PFA  = probability  of  false  alarm; 
px  (x)  = probability  density  function  of X ; 

77,77'=  detection  threshold; 

N = number  of  slow-time  pulses; 

M = number  of  spatial  channels; 

azimuth  and  elevation  angles; 
fd  = Doppler  frequency; 
s,(^,0)  = spatial  steering  vector; 

\\((f>,6)  = surrogate  spatial  steering  vector; 
s,„  ( <t>,0,fd ) = space-time  steering  vector; 
y,.x(<p,6,fd)  = surrogate  space-time  steering  vector; 

xUi  e C“‘  = spatial  snapshot,  k‘h  range  realization 
xk  e CmM  = space-time  snapshot,  k,h  range  realization; 
yk  = (scalar)  filter  output; 
wk  = space-time  weight  vector; 

Rk  = space-time  covariance  matrix; 

Rk,wk  = estimated  covariance  matrix  and  estimated  weight  vector. 


In  general,  a boldface,  lowercase  variable  indicates  a vector  quantity;  a boldface,  upper  case  variable  indicates 
a matrix;  and,  a variable  with  a caret  is  an  estimate.  Additionally,  an  optimum  filter  implies  clairvoyant 
knowledge  of  the  statistics  of  the  signal  environment.  Primary  data,  or  test  cells,  are  those  ranges  the 
processor  tests  for  target  presence.  Secondary  data  and  training  data  are  synonymous,  and  indicate  those  range 
cells  used  to  estimate  the  unknown  characteristics  of  the  primary  data.  Superscripts  “T”  or  “H”  applied  to  a 
vector  or  matrix  denote  the  transpose  or  hermitian  (conjugate  transpose)  operations.  The  notation 
a~CN(p,,RJ  indicates  that  a is  complex  normal  (Gaussian)  with  mean  p,  and  covariance  matrix  R, , and 
b~N(pb,Rb)  implies  b is  normally  distributed  with  mean  pb  and  covariance  matrix  Rb. 
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2.  Basic  Detection  Theory 

The  primary  objective  of  a radar  is  to  detect  a target  at  a given  range;  in  fact,  radar  is  an  acronym  for  “radio 
detection  and  ranging.”  This  section  of  the  paper  describes  the  basics  of  radar  detection.  Additionally,  we 
unify  detection  theory  and  predominant  notions  of  adaptive  filtering.  Excellent  sources  of  further  information 
on  radar  detection  include  [3-6].  The  whitening  operation,  matched  filtering,  and  SINR  are  the  key  elements 
of  this  unification. 

Clutter,  jamming  and  receiver  noise  are  random  variables  evolving  in  time.  For  instance,  we  can  never 
precisely  predict  the  clutter  voltage.  Rather,  we  can  describe  the  statistical  distribution  of  clutter,  jamming  and 
thermal  receiver  noise.  Similarly,  the  target  amplitude  is  a random  variable;  the  famous  Swerling  models  are 
usually  used  to  describe  target  amplitude  fluctuation  [1],  Due  to  the  random  nature  of  the  radar  signal 
environment,  ambiguity  as  to  whether  or  not  a target  is  present  is  always  a concern.  The  goal  of  detection 
theory  is  to  provide  a rational  procedure  for  confidently  determining,  in  a statistical  sense,  which  of  two 
possible 'models  - target  present  or  target  absent  - generated  a given  observation.  Hypothesis  testing  is  the 
cornerstone  of  detection  theory. 

Prior  to  our  hypothesis  testing  discussion,  we  briefly  provide  some  germane  background  information.  The 
probability  density  function  (pdf),  given  as  px  (x) , describes  the  relative  frequency  behavior  of  random 
variable  X . The  joint  pdf  between  X and  Y is  pXJ{x,y).  If  we  consider  n trials,  and  then  observe  specific 
occurrences  of  X - x a total  of  nx  times,  and  the  simultaneous  occurrence  of  X = x and  Y -y  a total  of  nw 
times,  then  px ( x ) ~nx  In  and  pXY (x,y)  ~ nxy  / n ; similarly,  we  find  pY (y)  ~nY  In  for  random  variable  Y . 
In  hypothesis  testing,  the  conditional  probability  density  is  most  significant.  Thus,  we  ponder  the  probability 
associated  with  observing  Y given  that  we  have  already  observed  certain  behavior  in  X . We  may  then  define 
the  conditional  probability  density  as  pYlx(y\x)  = pXY(x,y)/px(x)*=(nXY/n)/(nx/n).  If  X and  Y are 

statistically  independent,  then  pXY(x,y)  = px(x)pY(y) , since  pnx(y\x)  = pY(y)  as  a result  of  the  fact  that 
knowledge  of  X has  no  bearing  on  our  estimating  the  probability  that  Y -y . 

Radar  detection  is  a binary  hypothesis  testing  application.  The  Neyman-Pearson  criteria  (NPC)  are  most 
commonly  used  to  implement  this  hypothesis  testing  procedure  since  no  a priori  probabilities  are  necessary  in 
the  decision  mechanism.  In  radar,  the  NPC  aids  in  deciding  between  the  two  hypotheses:  H0  and  //, . H0  is 
known  as  the  null  hypothesis  and  corresponds  to  the  case  of  target  absence,  while  Ht  is  the  alternative 
hypothesis  and  indicates  target  presence.  The  NPC  nomenclature  includes  the  probability  of  detection,  PD, 
and  the  probability  of  false  alarm,  PFA : 

PD  = Pr[choose  H,  | Ht  is  true]  = J px]H  (x  | //,  )dx; 

T'  (2) 

PFA  = Pr [choose  H,  | H0  is  true]  = J pm (x  | H0)dx, 

T' 

where  7J  defines  the  “target  present”  decision  region.  The  goal  of  the  NPC  is  then  to  maximize  PD  while 
maintaining  PFA<y,  where  y is  the  maximum  false  alarm  rate  the  radar  system  and  associated  data  processor 
can  tolerate. 

Figure  1 shows  the  relationship  between  PD , PFA  and  the  threshold  setting  t| . Observe  a fundamental  truth 
from  this  figure:  lowering  the  threshold  to  increase  PD  necessarily  increases  PFA , and  viceversa.  The  regions 
of  overlap  among  the  two  pdfs  signifies  the  ambiguity  in  the  decision-making  process.  The  NPC  detector 
provides  an  optimum  mechanism  for  determining  target  presence  or  absence  in  light  of  this  ambiguity. 

Applying  the  NPC  in  an  optimization  procedure  (see  [34]  for  details)  leads  to  a decision  mechanism 
known  as  the  likelihood  ratio  test  (LRT),  which  is  given  by 
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A(x)=Pa>E>>I1.  (3) 

P,l„,(x|H„)< 

“o 

The  LRT  is  the  detector!  It  compares  the  ratio  of  the  two  likelihood  functions,  px!Hn  (x  |Hni)m=01 , to  the 

threshold  setting;  if  the  value  exceeds  the  threshold,  the  processor  declares  target  presence,  while  if  the  ratio  is 
less  than  the  threshold  setting,  the  processor  assumes  the  null  hypothesis. 


Figure  1.  Relationship  between  Pu,  PFA  and  threshold  setting,  T|. 

It  is  commonplace  to  apply  a monotonic  operation  to  simplify  the  LRT.  For  example,  the  natural  logarithm 
is  often  applied  to  reduce  expressions  involving  exponential  functions.  A sufficient  statistic  results  after 
manipulating  the  LRT  into  a canonical  form  involving  a function  of  the  observed  data.  We  denote  the 
sufficient  statistic  as 


'P(x)>t1'.  (4) 

< 

H° 

We  obtain  the  modified  threshold  in  (4)  by  deriving  the  pdf  of  H'(x)  under  the  null  hypothesis  condition  and 
then  solving 


~ j^P'F(x)IH„(^/(X)l^o)dX-7 


for  T]'.  As  one  might  expect,  we  ascertain  detection  performance  by  calculating  the  pdf  of  'F(x)  under  the 
alternative  hypothesis  case. 

Application  of  the  LRT  is  best  understood  through  example.  For  this  reason,  let’s  consider  a colored 
Gaussian  noise  (CGN)  scenario.  Our  hypotheses  are  given  as 

//, : x = Sj  + n (6) 

for  i = 0,1 , where  x is  the  vector  observation,  s,  is  the  target  signal,  s0  = 0 , and  n is  the  interference  vector. 
Also,  suppose  x represents  a coherent  dwell  of  N pulses,  spaced  by  the  pulse  repetition  interval,  Tp , i.e., 
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The  joint  pdf  of  (7)  is 


x = [*(0)  x(Tp)  - x((N-l)Tp)J. 


(7) 


= (8) 

Assuming  that  n is  distributed  according  to  a complex  Gaussian  distribution  with  zero  mean  and  covariance 
matrix  Rn  = E[nnH] , then 

^(n)=]~|exp(-n"R>)-  (9) 


Forming  the  LRT  using  (8)-(9),  then  taking  the  natural  log  and  simplifying  yields 


Re(sfR‘!x)- 


H , 

S,  > 1 , . . 

- — ln(?7) . 
< 2 


(10) 


The  first  term  on  the  left-hand  side  of  (10)  is  a whitening  filter,  R.1'2 , while  Rnl  2s,  is  a linearly  transformed 
matched  filter.  The  matched  filter  maximizes  SNR  (in  the  noise-limited  case);  hence,  in  the  presence  of 
colored-noise,  the  detector  first  whitens  (decorrelates)  the  colored-noise,  and  then  applies  a matched  filter  as  in 
the  noise-limited  case.  The  whitening  filter  and  matched  filter  will  be  discussed  in  further  detail  in  subsequent 
sections. 

It  then  follows  from  (10)  that  a sufficient  statistic  is 

", 

¥(x)  = Re(sfR 'P(x)  - A(Re(sfR-,sl),sfR;1s1).  (11) 

«0 

The  threshold  and  performance  of  the  detector  in  (1 1)  are  then  given  by 


ln(/7)-t-sfR^s, 

' Pn~0 

ln(77)-s1HR;'s1 

l ) 

l J 

(12) 


where  Q(x)  = -j=  fe  “ /2da  is  the  right-tail  distribution  (complementary  cumulative  distribution)  of  a 
V2jc  xj 

Gaussian  variate  with  zero  mean  and  variance  of  unity.  Numerical  tables  provide  values  of  Q(x)  and  Q'1  (x) 
[3].  We  then  use  the  expression  for  PFA  and  Q~l  (x)  to  calculate  the  threshold  r| . 

Figure  2 shows  an  upper  and  lower  bound  on  Q(x) . The  important  observation  from  (12)  and  Figure  2 


concerns  the  role  of  the  factor  sf  Rn's, ; this  term  represents  the  SINR.  As  sf  Rn'sj  increases,  the  argument 
in  (12)  decreases  for  the  PD  calculation  and  so  detection  performance  improves.  In  effect,  the  argument  in 


(12)  for  PFA  increases,  and  so  the  false  alarm  rate  decreases  below  the  specification  unless  the  threshold  is 
increased.  An  increase  in  SINR  serves  to  separate  the  pdfs  for  null  and  hypothesis  conditions,  making  it 
easier  to  distinguish  target  absence  from  target  presence.  Figure  3 clarifies  this  point  by  comparing  the 
histograms  for  null  and  alternative  hypothesis  with  three  different  values  of  signal-to-noise  ratio.  We  plot  the 
amplitude  of  the  complex  observations;  in  the  null  hypothesis  case  this  yields  a Rayleigh  distribution,  while 
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for  the  alternative  hypothesis  we  have  the  well-known  Rician  distribution  [6].  Adequate  SNR  or  S1NR  is 
essential  to  acceptable  detection  performance. 

The  appropriate  exploitation  of  signal  diversity  can  be  used  in  the  Gaussian  case  to  maximize  SINR,  hence 
yielding  maximal  detection  performance  for  any  specified  false  alarm  rate.  Perhaps  the  simplest  example  of 
the  benefits  of  signal  diversity  involves  sidelobe  noise  jamming.  For  a single  channel  system,  the  jamming 
enters  the  sidelobes  of  the  antenna  and  masks  target  detection  over  all  Doppler.  However,  since  the  noise 
jammer  is  spatially  correlated,  spatial  sampling  of  the  jammer  waveform  using  multiple  spatial  channels 
provides  information  on  the  jammer  direction  of  arrival.  This  information  can  then  be  used  to  construct  a 
whitening  filter  to  mitigate  all  signals  coming  from  the  jamming  angle.  Effectively,  the  detector  implements  a 
contrained  matched  filtering  operation,  where  the  constraint  places  a filter  null  in  the  jammer  direction.  The 
consequent  increase  in  SINR  leads  to  greatly  improved  detection  performance,  as  our  prior  discussion 
indicates. 

Ground  clutter  exhibits  a two-dimensional  correlation  in  angle  and  slow-time.  The  seminal  result  on  space- 
-time  detection  theory  for  radar  application  is  given  by  Brennan  and  Reed  in  [5],  Using  the  hypothesis  testing 
theory  of  the  prior  discussion,  we  have  the  mathematical  framework  to  develop  Brennan  and  Reed’s  result. 
Herein,  we  elect  to  synopsize  the  key  aspects  of  this  development. 

Consider  the  hypotheses 


H „ 


*k/H, 


— xM  + x^;  //, 


*k/H, 


— sT  + x^,. 


(13) 


where  x^,,  € CNMxl  is  the  space-time  snapshot  for  N pulses  and  M spatial  channels  for  the  kth  range 

realization  and  hypothesis  Hm.  Additionally,  xkJHi  - CN  (0,Rk) , where  Rk  is  the  null-hypothesis  covariance 

matrix,  sTeCNMxl  is  the  space-time  signal  vector,  and  xk/leCNMxl  and  xweCNllhl  are  space-time 
interference  and  noise  snapshots.  Each  space-time  snapshot  is  organized  as 


xk  =[XL(0)  xLo;)  •••  xL((^-l)7;)]re  (14) 


where  x^fnje  CMxl  is  the  spatial  snapshot  for  the  n*  pulse.  The  detector  involves  forming  a function  of  the 
filter  output,  yt  = wkxt , where  wke  CNMxl  is  the  space-time  weight  vector.  The  pdf  for  the  two  hypothesis 
of  (13)  are 


p(yt'n,)  = ~l 7exP 


y’klH, 


2 A 


, / = 0,1; 


with  the  mean  and  interference-plus-noise  variance  given  by 

(0  for  //0 
for//, 


cry=E 


Uy  {w"sT 


Rkwt. 


(15) 


(16) 


Upon  forming  the  LRT  and  algebraically  manipulating,  we  find  a sufficient  statistic  taking  the  simple  form 


h, 


«0 


(17) 
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The  pfd’s  of  (17)  for  null  and  alternative  hypotheses  are  Rayleigh  and  Rician,  respectively  [5],  Armed  with 
the  distributions,  we  can  then  determine  the  threshold  setting  and  detection  performance.  The  threshold  is 
calculated  from 


n 1 1 

f -Wf) 

Vi 

V 

•C3 

o 

II 

a> 

X 

T3 

{ 2<  ) 

(18) 


Detection  performance  is  given  by 


=&#(<*»£)=  Jvexl  p 

fi 


-(v2  +cr) 


I0(av)dv ; where, 


J 


SINR„ 


a=  wfsT 


, and  fi- 


JL 

G„ 


(19) 


Figure  2.  Upper  and  lower  bounds  on  the  complementary  cumulative  distribution  for  a Gaussian  variate. 


Figure  3.  Comparison  of  noise-only  histogram  with  signal-plus-noise  histograms  for  varying  SNR. 
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/„  (•)  is  the  modified  Bessel  function  of  the  first  kind  and  order  zero.  QM  (a,  (3)  is  known  as  Marcum’s  Q and 
is  a monotonically  increasing  function  of  the  square-root  of  SINR  and  monotonically  decreasing  function  of 
normalized  threshold  setting,  (3 ; numerical  techniques  are  required  to  solve  for  PD  in  (19). 

The  receiver  operating  characteristic  (ROC)  characteristic  of  the  space-time  detector  in  (17)  is  shown  in 
Figure  4.  The  different  curves  shown  in  the  figure  correspond  to  different  false  alarm  rates.  Observe  that  as 
the  false  alarm  rate  increases  - implying  a decrease  in  the  threshold  - the  detection  probability  increases. 


Receiver  Operating  Characteristic  (ROC) 


Figure  4.  Receive  operating  characteristic  for  space-time  detector. 

3.  Signal  Diversity 

Pulsed  radar  sensors  can  potentially  measure  angle  (azimuth,  elevation  or  cone  angle),  Doppler  (slow-time), 
range  (fast-time),  polarization,  and  the  multi-scan  behavior  of  target  and  interference  behavior.  Different 
classes  of  interference  exhibit  varying  correlation  over  the  available  sensor  measurement  spaces.  Table  1 
describes  different  interference  types  and  their  respective  correlation  properties. 


Table  1.  Correlation  Characteristics  of  Some  Inter: 

erence  Types 

t - Ui.'i  interference  Type  (V 

Correlation  Characteristics  - 

if  ihVc- Comments  , : - 

Ground  Clutter 

Azimuth  (or  cone  angle)  and  Doppler, 
polarization  response  may  differ  from 
clutter. 

Space  and  slow-time  adaptive  processing 
(STAP)  can  mitigate  clutter,  polarization 
may  aid  the  detection  of  very  slow  targets. 

Narrowband  Noise  Jamming 

Angle  (azimuth  and_elevation). 

Spatial  null  suppresses  jamming  signal. 

Wideband  Noise  Jamming 

Angle  and  fast-time. 

Variation  with  wavelength  must  be 
compensated  with  fast-time  processing, 
also  helps  with  channel  mismatch. 

Hot  Clutter 

Angle  and  fast-time. 

Jammer  waveform  often  correlated  in  fast- 
time. 

Cold  and  Hot  Clutter 

Angle,  slow-time  and  fast-time. 

Requires  3-D  STAP. 

The  ability  to  separate  interference  and  target  characteristics  in  the  chosen  measurement  spaces  enables  the 
processor  to  enhance  the  target  while  mitigating  the  interference.  Inherently,  the  processor  takes  advantage  of 
the  differences  in  correlation  properties  to  maximize  SINR  in  the  multidimensional  detection  space,  thereby 
maximizing  detection  performance.  Figure  5 provides  a notional  view  of  the  impact  of  signal  diversity  on 
detection  performance.  In  this  figure,  X , Y and  Z are  Fourier  transforms  of  measurement  domains  x , y 
and  z ; hence,  Figure  5 is  a three-dimensional  power  spectral  density  (PSD).  The  inverse  Fourier  transform  of 
the  PSD  yields  the  correlation  function.  Notice  the  separability  of  the  target  from  the  interference  in  the  three- 
dimensional  space.  However,  projecting  the  data  into  any  two-dimensional  data  space,  we  observe  the 
masking  of  the  target  by  the  interference.  When  the  target  shares  the  same  space  as  the  interference,  filtering 
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Figure  5.  Notional  view  of  the  importance  of  signal  diversity  on  target  detection.  In  three  dimensions,  the 
target  and  interference  are  separable,  and  hence  more  detectable,  while  in  two  dimensions  the  interference 

masks  the  target. 

The  impact  of  ground  clutter  on  detection  performance  serves  as  our  primary  focus.  For  this  reason,  the 
remainder  of  the  paper  considers  the  space  (azimuth  or  cone  angle)  and  slow-time  (Doppler)  dimensions. 
Figure  6 shows  the  PSD  for  simulated  ground  clutter;  observe  the  coupling  in  angle  and  Doppler  marked  by 
those  two-dimensional  frequencies  with  larger  clutter  power  values.  The  dashed  line  denotes  the  transmit 
direction.  The  processor  searches  for  targets  along  this  line  at  all  Doppler.  Mainlobe  clutter  - located  near 
zero  Doppler  and  zero  angle  - impedes  the  detection  of  slow-moving  targets.  Endo-clutter  detection  refers  to 
detection  of  low  radial  velocity  (low  Doppler)  targets  existing  in  the  diffraction-limited  clutter  regions. 
Additionally,  sidfelobe  clutter  - present  at  those  Doppler  frequencies  away  from  zero  Doppler  - masks  the 
detection  of  fester  moving,  exo-clutter  targets.  Figure  7 shows  the  corresponding  space-time  filter  response. 
The  filter  places  a null  on  the  clutter  ridge.  This  matched  two-dimensional  response  leads  to  maximal  SINR. 


Angle-Doppier  Power  Spectral  Density  dB 


Angle  (degrees) 

Figure  6.  Power  spectral  density  for  ground  clutter. 


has  little  or  no  impact  on  SINR.  In  the  three-dimensional  space,  the  detection  processor  can  construct  a filter 
to  maximize  the  gain  on  the  target  while  nulling  the  interference. 


interference 


x-y  projection 
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"OPTIMUM  FILTER  RESPONSE  dB 


Angle  (Degrees) 

Figure  7.  Space-time  optimal  filter  response. 

Signal  diversity  is  an  essential  component  in  enhancing  the  detection  of  slow  moving  and/or  weak  targets 
competing  with  ground  clutter  returns.  Choosing  other  measurement  spaces,  with  perhaps  the  exception  of 
polarization,  will  not  help  improve  performance.  As  can  be  seen  from  Figures  6-7,  it  would  be  very  difficult, 
if  not  impossible,  to  detect  a target  at  zero  angle  and  zero  Doppler.  Yet,  since  STAP  is  a member  of  the  class 
of  super-resolution  algorithms,  the  processor  can  detect  targets  very  close  to  mainbeam  clutter. 

4.  Performance  Metrics 

We  briefly  consider  several  of  the  more  useful  performance  metrics  used  to  characterize  STAP 
performance.  We  begin  by  describing  the  difference  between  adaptive  and  optimal  processors. 

STAP  is  a data  domain  implementation  of  an  optimal  filter.  In  developing  the  space-time  detector  in  (13)- 
(19),  we  pointed  out  that  by  maximizing  SINR,  the  processor  maximizes  PD . The  results  in  these  equations  are 
valid  for  any  weight  vector  wk . We  are  interested  in  the  weight  vector  leading  to  maximal  SINR,  which  is 
given  as 


wk=pRk's,.,(()),e,fd),  (20) 

where  p.  is  an  arbitrary  constant  and  s,.,(<{>,0,fd)  is  the  target  space-time*  steering  vector.  The  space-time 
steering  vector  is 

s^,  (<|>,  8,  fd ) = s,  (fd ) ® s,  «>,  6),  (21) 


with  s,(fd)e  CNxl  representing  the  temporal  (Doppler)  steering  vector  pointing  to  Doppler  frequency  fd,  and 
s,(<|>.0)e  CMxl  denoting  the  spatial  steering  vector  for  a signal  at  azimuth  <{>  and  elevation  0.  The  space-time 
steering  vector  is  the  normalized  response  of  the  radar  sensor  to  a target  with  a specified  angle  of  arrival  and 
Doppler  frequency,  i.e.,  sT  = ass.t(<)>,0,fd)  for  complex  amplitude  a. 

We  now  consider  a short  proof  on  the  optimality  of  (20).  Observe  that 


SINR  = 


_ E[yJi]  Ps 


*[y, 


(22) 
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where  ys  is  the  signal-only  filter  output,  yI+N  is  the  null  hypothesis  filter  output,  and  Ps  and  Pl+N  represent 
output  signal  power  and  null  hypothesis  output  power.  Additionally,  we  have  removed  the  space-time  steering 
vector’s  dependence  on  angle  and  Doppler  for  notational  convenience.  Next,  let  wk  = Az  and  s=Awss_, , 
where  A = Rk1/2 . Substituting  these  expressions  into  (22)  yields 


pi+N  = < Rk = z" AwRk Az  = z"z;  Ps  = E [ \a\ 


z"s 


Inserting  (23)  into  (22)  gives 


(23) 


(24) 


Using  the  Schwarz  Inequality,  we  find 


zHs'  = 


IK  [*i 


<zHzsH$. 


(25) 


Notice  that  an  optimal  value  for  z is  zop,  =jus ; with  this  selection,  the  SINR  is  maximal.  Hence, 


=)tAAHss.t  =pRk'ss.„ 


(26) 


which  is  the  result  we  desire. 

Several  comments  are  in  order.  First  of  all,  A = Rk''2  is  a whitening  filter  since  E[Axkxk  AH]  = INM  (i.e., 
the  whitening  filter  decorrelates  the  colored  noise  input,  hence  making  it  look  like  white  noise).  In  the  white 
noise  case,  s5_,  is  the  matched  filter;  it  has  a bandpass  response  with  angle  and  Doppler  and  maximizes  SNR. 
We  may  thus  interpret  the  optimal  weight  vector  as 


Wk/op«  — (Rk  )(Rk  Ss-t)’ 

Whitening  Warped 
Filter  Matched  Filter 


(27) 


The  warped  matched  filter  accommodates  the  linear  transformation  applied  to  the  space-time  signal  vector 
during  the  whitening  stage.  Finally,  keep  in  mind  that  SINR  is  a function  of  both  angle  and  Doppler. 

In  the  STAP  case,  both  the  covariance  matrix  and  target  space-time  steering  vector  are  unknown.  For  this 
reason,  the  processor  must  estimate  both  quantities.  To  handle  the  unknown  steering  vector,  the  processor 
sweeps  across  a series  of  pre-calculated  space-time  steering  vectors,  v,.t(<|),8,fd-) , covering  several  angle  bins 
about  the  transmit  direction  and  the  whole  Doppler  space.  Performance  degradation  resulting  from  steering 
vector  mismatch  is  generally  slight.  Covariance  estimation  is  more  complicated.  Commonly,  we  use  the 
estimate 


„ 1 K 
Rk=— Yx„x" 
k K, 


(28) 


where  xm  are  training  (secondary  data)  taken  from  ranges  adjacent  to  the  test  (primary)  cell  [7],  The  adaptive 
weight  vector  is 


wk=pRkIv,.t(<l>,e,fd), 


(29) 
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where  P is  a constant  typically  set  to  p = 1 or  P = ^v(H,(<j>,0,fd)Rk  'v,.,(Wd)- 

If  the  secondary  data  are  multivariate  Gaussian  and  independent  and  identically  distributed  (iid)  with 
respect  to  the  null-hypothesis  of  the  primary  data,  (28)  is  a maximum  likelihood  estimate.  Selecting  the 
number  K of  secondary  data  to  obtain  a suitable  estimate  via  the  calculation  of  (28)  is  addressed  in  [7]. 
Specifically,  Reed,  Mallett  and  Brennan  have  shown  that 

E[(SINR|wk)/(SINR0plimum)]=.(K  + 2-NM)/(K  + l) . (30) 

The  famous  Reed-Mallett-Brennan  (RMB)  Rule  then  states  that  for  the  adaptive  processor  to  attain  an  SINR 
loss  of  -3  dB  in  comparison  with  the  optimal  processor  requires  K = 2NM-3 . The  RMB  Rule  assumes  the 
secondary  data  are  iid. 

Since  a one-to-one  correspondence  exists  between  SINR  and  PD  in  the  Gaussian  case,  SINR  loss  factors 
are  very  convenient  and  commonly  used  metrics  of  STAP  performance  [8].  We  may  write  the  output  SINR  as 

SINR(fc  9,  fd ) = SNR(4>,  0)  x Lsl  (<)>,  0,  fd ) x Ls , (<)>,  0,  fd ); 

0 - Ls  i (<)>,  0,  fd ),  Ls , (<|),  0,  fd ) < 1 . (31) 


SNR(<|),0)  follows  from  (1).  L5l(<|>,0,fd)  and  Ls 2 (<j>, 0, fd ) are  SINR  loss  factors:  Lsl(<|>,0,fd)  characterizes 

the  loss  in  signal-to-noise  ratio  due  to  colored  noise,  while  Li2(<|>,0,fd)  measures  the  loss  in  SINR  resulting 

from  the  error  between  optimal  (known)  and  adaptive  (estimated)  weight  vectors.  Ls , (<f>, 0, fd ) captures  the 

impact  of  both  the  system  design  and  signal  processing  algorithm  selection  on  performance  in  interference- 
limited  environments.  Definitions  of  both  SINR  loss  factors  are 


_ SINR(<j).0,fd)|^  ^ f SINR(4>, 0, fd ) |w  Adaptive  Output  SINR 

SNR(<{>,0)  ’ ’ d “ SINR((j),  0, fd ) ~ Optimal  Output  SINR  ' 


Substituting  (32)  into  (31)  yields  the  identity  SINR(<J),0,fd)  = SlNR((J),0,fd)|ik . 

Figure  8 shows  the  Ls  l term  for  the  optimal  space-time  processor  (STP)  and  the  conventional  digital 

beamformer  (DBF)  followed  by  Hanning-weighted  Doppler  processing  (DOP).  This  example  corresponds  to 
the  same  interference  environment  shown  in  Figure  6,  except  the  dwell  has  been  doubled.  The  poor 
performance  of  the  conventional  approach  is  a consequence  of  the  short  dwell  and  small  aperture  used  in  this 
example.  However,  the  example  does  highlight  the  tremendous  performance  potential  of  STAP  (in  an  iid 
environment,  the  STAP  SINR  loss  curve  appears  similar  to  the  optimal  STP  L5 , curve  of  Figure  8,  with  a 
downward  shift  of  a few  decibels).  The  0 dB  line  in  Figure  8 corresponds  to  noise-limited  performance.  From 
the  SINR  loss  curves,  we  derive  another  very  important  STAP  metric:  the  minimum  detectable  velocity 
(MDV).  The  MDV  follows  from  a specification  of  the  maximum  tolerable  SINR  loss  if  the  system  is  to  meet 
detection  performance  requirements.  If,  for  example,  5 dB  is  the  tolerable  loss,  the  radar  system  MDV  in  our 
example  is  ±(A./2)x(60  Hz)=±9  m/s  for  the  optimal  STP,  where  X = 0.3  m is  the  RF  wavelength. 

The  optimal  and  adaptive  filter  responses  are  also  useful  metrics.  Given  a weight  vector,  we  compute  the 
filter  output  response  by  sweeping  across  all  space-time  steering  vectors  of  interest,  viz. 


(4>. 8. fd ) = | wX (<!>, 0, fd )|2 ; H^W, 0,fd)  = |wX (4>, 8, fd )f 


(33) 


An  example  of  the  optimal  response  is  given  in  Figure  7;  the  filter  is  tuned  to  broadside  and  250  Hz.  Doppler. 
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SINR  Loss 


5.  Space-Time  Detector  Structures 


In  this  section  we  begin  by  briefly  considering  some  normalizations  applied  in  STAP-type  detectors.  Then, 
we  discuss  the  concept  of  interference  rank,  examine  several  reduced-rank  or  reduced-dimension  processing 
schemes,  and  consider  the  role  of  diagonal  loading  in  STAP  implementation. 

Recall,  in  (20)  and  (29)  the  optimal  or  adaptive  weight  vector  includes  arbitrary  scalars  p and  |5, 
respectively.  While  a particular  selection  of  either  scalar  does  not  affect  SINR,  certain  selections  are 
convenient  in  subsequent  processing.  A commonly  used  detector  and  normalization  is  given  by 


w"x 

"k  Ak 


'x 

k Ak 


;-t*H  Ss-t 


Hi 

> 

• i 
< 
H0 


Vi 


(34) 


where  p = l/^s”,Rk'ss_,  in  this  case  [9].  Let  wk  =Rk's,.t(<(),0,fd),  such  that  wk=pwk.  Then,  1/p2  equals 
the  output  noise  power  of  the  filter  with  weight  vector  wk . In  the  unknown  covariance  case,  we  substitute  the 
form  in  (28)  into  (34),  yielding  the  modified  sample  matrix  inversion  (MSMI)  detection  statistic.  In  this  case, 
(34)  becomes 


h, 

* ) ; *k  = K\-< > (35) 

Ho 

where  we  further  calculate  t),  for  the  single  channel,  single  pulse  case  with  the  noise  variance  set  to  unity. 

The  term  l/(32  =sf.(Rk's5.,  modifies  the  threshold  to  account  for  varying  interference  power  levels  (and 
integration  gain).  As  shown  in  [9],  this  normalization  leads  to  a constant  false  alarm  rate  (CFAR)  property. 
As  the  noise  power  increases,  the  threshold  increases  to  maintain  CFAR,  and  vice  versa. 

Kelly  formulated  the  generalized  likelihood  ratio  test  (GLRT)  of  the  form 


W?*k  = 


Ss-(lvk  Ak 


H, 

>. 


sH  R"‘s 
“s-t^k  Vt 


f 1 „ - , 

lH  xk  Rk  xk 


K 


Kn2, 


< 

Ho 


(36) 
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where  K is  the  number  of  secondary  data  samples  [10].  This  detector  also  possesses  the  CFAR  property.  It 
differs  in  form  from  (34)  as  a result  of  different  modeling  assumptions:  in  the  GLRT,  the  detector  assumes  the 
covariance  matrix  is  unknown.  The  derivation  of  (36)  generally  follows  the  same  LRT  development  described 
in  the  initial  sections  of  this  report.  Observe  the  detector  can  be  written 


A data  dependent  threshold  modifies  the  MSMI  threshold  to  account  for  the  finite  number  of  training  data.  As 
K becomes  large,  the  test  statistic  approaches  the  MSMI  result. 

Figure  9 provides  a block  diagram  of  the  overall  process.  Essentially,  a STAP  filter  suppresses 
interference.  The  scalar  filter  output  is  multiplied  by  a data-dependent  “threshold  multiplier”  - either  1/a2  or 
1/a2  multiplied  by  the  factor  in  brackets  on  the  far  right-hand  side  of  (37)  - and  then  compared  to  a fixed 
threshold. 


Figure  9.  Embedded  CFAR  in  STAP  processing. 

Other  normalizations  are  possible.  For  instance,  the  minimum  variance  beamformer  chooses 
pMV  = l/s‘‘,R['s,.t  [11],  This  normalization  accordingly  affects  the  interpretation  of  the  data. 

STAP  is  effectively  a catch-all  for  a class  of  linear  filters  involving  SINR  maximization  of  the  input  data 
[8].  We  can  broadly  characterize  the  class  of  STAP  methods  into  two  sub-groups:  reduced-dimension  (RD) 
STAP  and  reduced-rank  (RR)  STAP.  RD-STAP  involves  data-independent  transformations  and  bin  selection, 
whilst  RR-STAP  is  a data-dependent  approach.  We’ll  consider  these  two  sub-groups  in  turn.  The  ultimate 
goals  of  either  RD-STAP  or  RR-STAP  include  reducing  computational  burden  and  reducing  required  sample 
support. 

Figure  10  is  an  overview  of  RD-STAP  methods.  The  processor  can  transform  the  space-time  data  via  the 
data  independent  transformation 


xk  = THxk  e CPQl1,  (38) 

where  T is  the  transformation  matrix  and  ideally  PQ«NM.  Common  transformations  include 
beamforming  and  Doppler  processing  steps.  By  analogy  to  our  prior  discussion,  the  required  optimal  RD 
weight  vector  yielding  maximum  SINR  in  the  reduced  space  is 

wk=(THRkT)_1THs,,.  (39) 

In  practice,  the  RD-STAP  weight  vector  is  given  by 


A A A I K 

wk=Rk'v;  Rk=-7lvli  v=T  v,,, 

^ m=1 


(40) 
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where  K'«K  indicates  a reduction  in  required  secondary  data.  Also,  RD-STAP  computational  burden  is 
0(P3Q3) , which  typically  is  far  less  than  for  the  space-time  case  with  computational  load  0(NJM3) . 


Spatial  Filtering 


Doppler 

Filtering 


Figure  10.  Overview  of  reduced-dimension  STAP  methods. 


Examples  of  RD-STAP  techniques  are  available  in  [8,  12-13].  The  Factored  Time-Space  (FTS)  algorithm 
is  a post-Doppler  method  suitable  for  long  coherent  dwells  and  higher  radial  velocity  targets.  The  FTS  method 
essentially  involves  spatial  notching  of  the  clutter  in  a given  Doppler  filter.  Since  FTS  provides  no  temporal 
adaptivity,  it  is  not  a true  STAP  algorithm.  To  enhance  performance  with  only  modest  increase  in  required 
sample  support  and  computational  burden,  DiPietro  proposed  the  Extended  Factored  Algorithm  (EFA)  [12]. 
The  EFA  method  involves  adaptively  combining  several  adjacent  Doppler  filters  (typically  three)  and  all 
spatial  channels.  The  EFA  method  often  exhibits  performance  very  close  to  the  theoretical  joint-domain  (JD) 
space-time  bound.  To  provide  diversity  in  spatial  and  temporal  DoF,  Wang  developed  the  Joint  Domain 
Localized  (JDL)  technique  [13].  JDL  is  a post-Doppler,  beamspace  method.  Basically,  the  processor  forms 
multiple  beams,  then  Doppler  processes  each  beam,  and  finally  selects  a collection  of  adjacent  angle-Doppler 
bins  over  which  to  adapt  the  filter  response.  JDL  provides  good  performance  with  very  low  training  data 
requirements  and  very  modest  computational  burden.  Three  adjacent  beams  by  three  adjacent  Doppler  bins  is 
a typical  localized  JDL  processing  regions  [13].  Figure  11  shows  the  SINR  loss  for  each  of  the  RD-STAP 
methods  for  the  same  example  shown  in  Figure  8;  we  compare  performance  against  the  upper  bound  given  by 
the  JD  space-time  filter.  Each  RD-STAP  method  uses  secondary  data  support  of  twice  the  processor’s  DoF. 
Note,  our  discussion  did  not  include  an  example  of  a joint  pre-Doppler  and  angle  method. 

Reduced-rank  STAP  methods  involve  data-dependent  transformations  and  selections  [14-15].  Consider  the 
following  formulation  of  the  optimal  weight  vector 


wt 


(**,/„)- 


AW  Xm  - Xa 


amqm 


(41) 


where  Xm  is  an  eigenvalue  of  Rk  with  eigenvector  qm  , X0  =min(A,m)  and  aro  is  the  projection  of  the  mth 
eigenvector  onto  the  quiescent  response  given  by  s5_t . In  the  adaptive  case,  we  substitute  Rk  for  Rk . Thus, 
we  can  view  STAP  as  a pattern  synthesis  problem:  based  on  a sensing  of  the  environment,  the  processor  places 
notches  in  the  quiescent  pattern  to  mitigate  interference.  Notice  from  (41)  that  no  subtraction  occurs  for 
eigenvalues  at  the  noise  floor.  The  principal  components  (PC)  method  involves  truncating  the  summation  in 
(41)  to  accommodate  only  the  largest  eigenvalue  terms.  A benefit  of  this  approach  is  a reduction  in  training 
data  support.  However,  computational  burden  remains  high,  since  the  processor  must  compute  eigenvalues 
and  eigenvectors.  An  alternative,  yet  similar,  approach  to  the  PC  method  is  given  in  [15]  and  is  known  as  the 
Principal  Components  Inverse  (PCI)  method.  Specifically,  the  processor  computes 
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< =(INM  -Q,ft,Q?)xk=xk  - (42) 

m=l  A 

m 

where  Q,  is  the  collection  of  interference  eigenvectors  (corresponding  to  each  column),  ft,  is  the  matrix 
enabling  perfect  subtraction  of  the  designated  interference  term  in  xk , al!m  is  the  projection  of  the  m,h 
eigenvector  onto  the  data  snapshot,  and  A,!m  is  a non-zero  eigenvalue  of  the  interference-only  eigenvector.  In 
practice,  we  cannot  estimate  the  interference-only  eigenvector,  and  so  for  strong,  low-rank  interference  we  can 
set  &iim  = ^m,  or  Q,  = INM(:,1 : P)  for  the  P dominant  components.  Since 

Q,Qf+QX=lNM,  (43) 

where  QN  is  the  collection  of  noise  eigenvectors  (corresponding  to  each  column),  we  can  express  (42)  as 

< = QnQ?V  (44) 

Equation  (44)  indicates  that  PCI  can  operate  by  using  the  noise  subspace  information.  This  stands  in  contrast 
to  the  PC  method  using  the  dominant  subspace.  Using  our  familiar  example  found  in  Figure  8 and  Figure  11, 
Figure  12  compares  the  SINR  loss  for  the  PC  and  PCI  methods  against  the  upper  bound  defined  by  the  JD 
STAP  solution.  The  adaptive  solution  involves  a secondary  data  set  of  one  times  the  joint  domain  processor’s 
total  DoF,  i.e.  K = NM . The  interference  subspace  has  dimension  40,  and  so  we  use  all  40  principal 
components  for  the  PC  method  and  all  3 12  noise  subspace  components  for  the  PCI  approach.  Armed  with  this 
information,  observe  that  both  methods  very  closely  follow  the  optimal  performance.  In  practice,  choosing  the 
correct  number  of  components  is  challenging  and  an  incorrect  selection  can  lead  to  performance  degradation. 


SINR  Loss  Comparison  for  RD-STAP  Methods 


Oftentimes,  the  covariance  estimate  is  ill-conditioned  or  exhibits  a perturbed  noise  subspace.  In  such  cases, 
diagonal  loading  can  be  used  to  tailor  performance  [16].  As  the  name  implies,  diagonal  loading  involves 
adding  an  identity  matrix  to  the  covariance  estimate  to  stabilize  the  noise  floor  and  condition  the  covariance 
matrix  estimate  for  subsequent  numerical  processing.  From  (41)  we  find  that  if  the  noise  eigenvalues  are 
perturbed,  the  processor  subtracts  a noise  eigenvector  from  the  quiescent  response.  Noise  eigenvectors  exhibit 
random  sidelobe  behavior.  Hence,  the  adaptive  filter  response  for  low  secondary  data  set  sizes  shows  elevated 
sidelobes.  High  sidelobes  can  be  problematic,  leading  for  instance  to  sidelobe  target  detection.  Diagonal 
loading  cures  this  problem. 
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SINR  Loss  Comparison  for  RR-STAP  Methods 
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Figure  12.  SINR  loss  for  RR-STAP  methods. 


6.  Heterogeneous  Clutter  Effects 

Adaptive  processing  requires  secondary  (training)  data  to  estimate  unknown  quantities.  For  example, 
traditional,  CFAR  circuits  [6],  which  involve  scalar  adaptive  processing,  typically  select  ten  range  bins  on  each 
side  of  the  test  and  guard  cells  to  estimate  the  ambient  interference  power.  STAP  similarly  requires  a training 
interval  to  estimate  the  null  hypothesis  covariance  matrix.  However,  in  contrast  with  the  scalar  CFAR  case, 
the  potentially  high  dimensionality  of  STAP  requires  training  data  selection  over  large  regions.  Suppose  a 
STAP-based  radar  has  N = 32  (thirty-two  pulse  dwell)  and  M = 1 1 (eleven  spatial  channels).  The  minimum 
training  set  is  then  on  the  order  of  2NM  = 704  range  bins.  If  the  instantaneous  bandwidth  is  1 MHz,  training 
data  comes  from  a region  extending  over  10  km!  Over  the  10  km  interval,  the  changing  cultural  features  of  the 
clutter  environment  lead  to  range-angle  variation  of  ground  clutter  returns.  This  range-angle  variation  in  then 
ground  clutter  return  is  known  as  clutter  heterogeneity  [17-18], 

Common  STAP  training  approaches  include  the  following  methods:  block  selection,  windowed  block 
selection  and  sliding  window.  In  the  block  selection  approach,  the  processor  selects  training  data  sequentially 
from  a given  region  and  then  applies  it  to  a distinct  primary  data  region.  The  windowed  block  selection  is  a 
variant  of  the  block  selection  technique;  the  training  data  is  chosen  sequentially  about  the  primary  data  region, 
with  each  half  of  the  training  data  coming  from  opposite  sides  of  the  primary  data  regions.  In  the  sliding 
window  approach,  the  processor  attempts  to  localize  training  data  within  the  vicinity  of  the  primary  data. 
Thus,  the  processor  chooses  training  data  symmetrically  about  the  test  data  and  adjacent  guard  cells.  The 
sliding  window  approach  is  most  computationally  intense,  but  numerical  techniques  are  available  to  alleviate 
the  computational  burden  of  both  covariance  estimation  and  covariance  matrix  inversion. 

As  a result  of  clutter  heterogeneity,  the  precise  yet  unknown  covariance  matrix  of  a given  range  cell  does 
not  match  other  ranges.  Hence,  heterogeneous  clutter  environments  are  no  longer  iid.  The  non-iid 
environment  gives  rise  to  increased  covariance  estimation  errors  when  using  (28).  Specifically,  since 
Rp*Rm  in  non-iid  environments,  where  p and  m indicate  the  training  vector  indices  and  p # m , it  stands  to 

reason  that  ' 


E[Rk]  = E|j  Zxmx»l  = l h[v!]=|  IR 

L J m=]  K m=I  L J K m=1 


(45) 


In  other  words,  in  heterogeneous  clutter  environments,  the  covariance  matrix  estimate  tends  to  an  average 
characteristic  of  the  training  data.  For  this  reason,  instantaneous  performance  can  be  poor. 

Clutter  heterogeneity  manifests  as  a modification  of  certain  components  and  the  addition  of  additional 
terms.  The  angle  and  Doppler  behavior  of  ground  clutter  is  well  defined  and  results  from  sensor  geometry  and 
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the  platform  velocity  vector  [8].  However,  clutter  amplitude  and  spectral  width  can  vary,  and  target-like 
signals  lead  to  extra  components.  Tabic  2 synopsizes  the  various  classes  of  clutter  heterogeneity  and  their 
impact  on  STAP  performance. 

The  covariance  estimation  errors  due  to  clutter  heterogeneity  lead  to  increased  SINR  loss,  i.e.  Ls , further 

approaches  zero.  Clutter  heterogeneity  is  a challenge  to  STAP  implementation,  since  adaptive  and  optimal 
performance  does  not  necessarily  converge  w'ith  increasing  number  of  training  vectors.  This  realization  is 
driving  new  STAP  approaches  holding  the  potential  for  robust  performance  in  light  of  cluttter  heterogeneity 
and  other  behavior  (see  the  last  entry  in  Table  2)  leading  to  non-iid  training  data. 


Table  2.  Taxonomy  of  Clutter  Heterogeneity 


Heterogeneity  Type 

Cause  r; 

Impact  on  Adaptive  Radar 

Amplitude 

Shadowing  and  obscuration,  range-angle 
dependent  change  in  clutter  reflectivity,  strong 
stationary  discretes,  sea  spikes,  urban  centers, 
land-sea  interfaces,  etc. 

Null  depth  depends  on  eigenvalue  ratio  - 
MLE  “averaging”  leads  to  underestimated 
eigenvalue  magnitude,  and  consequently, 
uncancelled  clutter  and  increased  false  alarm 
rate. 

Spectral 

Intrinsic  clutter  motion  due  to  soft  scatterers 
(trees,  windblown  fields,  etc.),  ocean  waves; 
CNR-dcpcndent  spectral  mismatch. 

Null  width  set  to  mean  spread  - too  narrow 
for  some  range  cells,  thereby  leading  to 
uncancelled  clutter,  seriously  degraded 
MDV. 

Moving  Scatterers 

Ground  traffic,  weather,  insects  and  birds,  air 
vehicles. 

Mainlobe  nulling,  false  sidelobe  target 
declarations,  distorted  beam  patterns, 
exhausts  DoF. 

Some  Other  Effects 

• 

Chaff,  hot  clutter,  multi-bounce/ multipath, 
impact  of  platform  geometry  (eg.,  non- 
sidelooking or  bistatic)  on  angie-Dopplcr 
behavior  over  range. 

Combination  of  above  effects. 

7.  Summary 

In  this  set  of  lecture  notes,  we  consider  fundamental  aspects  of  space-time  adaptive  detection  theory. 
Section  2 describes  the  basics  of  hypothesis  testing  and  relates  this  information  to  space-time  detection.  We 
introduce  the  likelihood  ratio  test  in  this  section  and  through  example  demonstrate  its  use.  Additionally,  we 
describe  the  important  whitening  filter  and  matched  filter  concepts,  relating  them  to  the  detection  statistic.  We 
then  describe  the  application  of  signal  diversity  to  advanced  sensor  design  in  Section  3.  Signal  diversity  helps 
cull  the  weak  target  signal  masked  by  interference.  We  provide  an  example  showing  the  space-time 
characteristics  of  ground  clutter  and  the  advantage  space-time  degrees  of  freedom  offer  in  enhancing  detection 
performance.  Section  4 first  develops  the  maximum  signal -to-interference-plus-noise  ratio  (SINR)  filter, 
which  maximizes  the  probability  of  detection  in  the  Gaussian  case.  We  then  describe  commonly  used  STAP 
performance  metrics.  We  overview  space-time  detector  structures  in  Section  5.  In  this  sectioiFwe  first  begin 
by  examining  STAP  normalizations  leading  to  a constant  false  alarm  rate  (CFAR)  property.  We  then  describe 
the  philosophy  behind  reduced-dimension  and  reduced-rank  STAP  methods.  Reduced-dimension  STAP 
involves  data  independent  transformations  to  characterize  interference  behavior  in  a smaller  subspace. 
Advantages  of  reduced-dimension  STAP  include  a requirement  for  smaller  training  data  sets  and  minimal 
computational  burden.  Reduced-rank  techniques  involve  data  dependent  transformations  to  derive  a stochastic 
basis  describing  the  interference  properties.  The  principal  component  reduced-rank  technique  can  achieve  the 
upper  bound  on  SINR  performance  when  properly  applied.  We  also  introduce  the  principal  components 
inverse  method  in  this  section.  The  paper  concludes  with  a brief  discussion  of  clutter  heterogeneity.  We 
define  clutter  heterogeneity  and  describe  its  impact  on  STAP  implementation. 
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